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Abstract — A  number  of  network  applications  require  stable 
transport  throughput  for  tasks  such  as  control  and  coordination 
operations  over  wide-area  networks.  We  present  a  window-based 
method  that  achieves  stable  throughput  at  a  target  level  by 
utilizing  a  variation  of  the  classical  Robbins-Monro  stochastic 
approximation  algorithm.  We  analytically  show  the  stability  of 
this  method  under  very  mild  conditions  on  the  network,  which  are 
justified  by  Internet  measurements.  Our  User  Datagram  Protocol 
(UDP)-based  implementation  provides  stable  throughput  over  the 
Internet  under  various  traffic  conditions. 

Index  Terms — Rohbins-Monro  algorithm,  stochastic  approxi¬ 
mation,  transport  stahilization. 


I.  Introduction 

A  number  of  next  generation  network-based  applications 
require  stable  throughput  for  control  of  processes  over 
wide-area  networks.  Examples  include  controlling  mobile 
robots  remotely  over  wide-area  networks,  computational 
steering  of  remote  large  scientific  simulations,  and  coordinated 
visualization  of  distributed  datasets.  Typically,  the  bandwidth 
requirements  are  modest  in  these  applications,  often  requiring 
only  a  small  fraction  of  the  available  bandwidth.  But,  it  is 
extremely  important  that  the  throughput  rate  at  remote  site(s) 
be  stable  in  presence  of  dynamic  traffic  conditions.  For  ex¬ 
ample,  large  amount  of  jitter  in  throughput  can  destabilize  the 
control  loops  needed  for  remote  robots,  possibly  causing  severe 
damage  to  them. 

The  widely  deployed  Transmission  Control  Protocol  (TCP) 
is  not  designed  for  providing  such  stable  throughput  for  several 
reasons.  First,  it  continues  to  increase  its  throughput  until 
losses  are  encountered  which  often  results  in  much  higher 
throughput  than  needed,  especially  in  over-provisioned  net¬ 
works.  Second,  perhaps  more  importantly,  TCP  drastically 
reduces  its  throughput  to  levels  significantly  below  the  required 
in  response  to  bursty  losses.  While  TCP  throughput  can  be  cur¬ 
tailed  around  the  desired  values  by  clipping  the  flow  windows, 
its  nonlinear  dynamics  make  this  task  very  challenging  [6]; 
in  fact,  TCP  is  provably  chaotic  under  certain  conditions  [5], 
which  makes  the  throughput  stabilization  particularly  difficult. 
Even  if  such  approach  is  successful,  TCP  offers  no  protection 
against  the  “underflow”  particularly  in  high  traffic  networks. 
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We  show  that  a  dynamic  version  of  the  classical  Robbins- 
Monro  method  [3],  [7]  offers  a  provably  stable  throughput  under 
very  general  conditions  that  can  be  justified  using  Internet  mea¬ 
surements.  Our  implementation  based  on  User  Datagram  Pro¬ 
tocol  (UDP)  achieved  very  stable  and  robust  throughput  over 
the  Internet  under  various  traffic  conditions. 

II.  Problem  Formulation 

We  consider  the  problem  of  stabilizing  a  transport  stream 
from  a  source  node  S'  to  a  destination  node  D  over  a  wide-area 
network,  typically  the  Internet.  The  objective  is  to  achieve  a 
target  throughput  rate  7  at  _D  by  dynamically  adjusting  the 
sending  rate  at  S  in  response  to  network  conditions.  Packets 
are  sent  from  S  and  are  acknowledged  by  D.  Both  packets 
and  their  acknowledgments  can  be  delayed  or  lost  during 
the  transmission  due  to  a  variety  of  reasons,  including  buffer 
occupancy  levels  at  routers  and  hosts,  and  link  level  losses.  Fet 
Rs{t)  and  Gnit)  denote  the  sending  rate  at  S  and  throughput 
or  goodput  at  D,  respectively.  Each  rate  is  given  by  the  number 
of  packets  sent  and  received  at  S  and  D,  respectively,  divided 
by  the  interval  duration.  The  response  plot  corresponds  to 
values  of  Gd(.)  plotted  against  Rs{-)-  Typically,  in  wide-area 
transports,  Gni  )  increases  with  Rs(-)  in  an  overall  sense  for 
lower  values,  often  incurring  very  low  losses.  For  higher  values 
of  Rs(-),  there  is  an  increase  in  losses  and  Gd(-)  decreases 
in  an  overall  sense.  Such  overall  behavior  is  well-known  and 
has  been  ideally  modeled  as  fixed  smooth  concave  utility 
functions  in  a  number  of  transport  control  works  [2],  [4];  under 
these  assumptions,  the  stabilization  problem  involves  simply 
computing  Rs  value  such  that  Gnit)  =  7  (if  such  utility 
functions  are  known  precisely). 

In  practical  wide-area  networks,  however,  such  precise  char¬ 
acterization  is  not  feasible.  In  most  cases,  one  only  has  access 
to  the  measurements  at  source  (including  the  ones  sent  by  des¬ 
tination)  to  adjust  Rs(t).  The  measurements  shown  in  Fig.  1 
are  collected  between  Oak  Ridge  National  Faboratory  (ORNF), 
Oak  Ridge,  TN,  and  Fouisiana  State  University  (FSU),  Baton 
Rouge,  FA.  The  datagrams  within  a  window  are  all  sent  continu¬ 
ously  followed  by  a  waiting  period,  and  this  process  is  repeated. 
In  the  horizontal  plane  each  point  corresponds  to  window-size 
and  waiting-time  (or  idle-time)  pair,  the  ratio  of  which  speci¬ 
fies  the  top  and  bottom  plots  represent  Gn{t)  and  loss 

rate,  respectively.  For  illustration,  let  us  fix  the  waiting-time  and 
increase  the  window-size  which  corresponds  to  taking  vertical 
slices  of  the  plots  parallel  to  the  window-size  axis.  There  are 
two  important  features:  1)  there  is  an  overall  trend  of  increase 
followed  by  decrease  in  Gd  as  Rs  is  increased;  this  overall  be¬ 
havior  is  quite  stable  although  the  transition  points  vary  over 
time  and  2)  the  plot  is  quite  nonsmooth  mostly  because  of  the 
randomness  involved  in  packet  delays  and  losses;  derivation  of 
smooth  utility  functions  from  the  response  plots  is  inherently 
approximate  and  requires  a  large  number  of  observations  in  a 
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well-known  stochastic  approximation  (SA)  algorithm  [3].  Intu¬ 
itively,  Wi  Is  Increased  If  the  estimate  of  throughput  Is  below  7 
and  decreased  otherwise.  Initially  Wo  Is  chosen  based  on  mea¬ 
surements  so  thati?5(io)  Is  In  the  vicinity  of  7. 

We  compute  the  acknowledgment  rate  As{t)  with  every 
acknowledgment  received  at  S  by  dividing  the  total  number 
of  received  acknowledgments  by  the  time  expired  so  far.  Thus 
Rs{t)  -  As{t)  Is  the  sum  of  loss  rate  of  packets  from  Sto  D 
and  acknowledgments  from  D  to  S.  We  assume  a  symmetric 
loss  process  so  that  the  loss  rate  of  packets  and  that  of  acknowl¬ 
edgments  are  Identically  given  by  [Rs{t)  -  As{t)]/2.  Then 
we  employ  the  estimate 


Gs{t)  =  As(t)  + 


Rs{t)  —  As(t) 
2 


As(t)  +  Rs(t) 
2 


Fig.  1.  Internet  measurements  with  sending  rate  along  horizontal  plane,  and 
throughput  and  loss  rate  along  vertical  axes  in  top  and  bottom  plots,  respectively. 

preprocessing  step.  In  practice,  S  computes  an  approximation 
Gs{t)  to  Goit)  typically  based  on  acknowledgments. 

We  assume  that  7  Is  specified  much  below  the  peak  of  the  re¬ 
sponse  plot,  where  theloss  ratelsvery  small  (If  any)  as  shown  In 
Fig.  1.  Due  to  a  wide  variety  of  factors  such  as  competing  traffic, 
buffer  occupancy  at  hosts  and  routers,  and  packet  losses,  Goit) 
has  a  significant  random  component  In  addition  to  Its  overall 
dependence  on  Rs{.).  Furthermore,  S  does  not  have  access  to 
Goit)  and  It’s  estimates  are  communicated  over  the  network, 
either  directly  or  Indirectly  as  part  of  acknowledgments.  Conse¬ 
quently,  Gs{t)  Is  a  further  “randomized”  version  of  Goit  -  S), 
for  some  ^  >  0. 

Let  r(r)  be  the  response  regression  given  by  the  expected 
value  of  Gd  corresponding  to  fixed  Rsit)  =  r,  that  Is 


If  i?s(.)  Is  fixed,  the  symmetric  loss  condition  yields  the  flow 
equation  ^;[i?s(t)]  =  ^;[Gi5(i)  +  (i?s(i)-is(i))/2]  orequlv- 
alently  i?[i?s(f)]  =  2E[GD{t)\  -  E[As(t)].  Then,  using  the 
above  equation.  It  follows  that  Gs  Is  a  conditionally  consistent 
estimator  of  Gd,  namely 


E 


GsWIGttW  =  a: 


=  X. 


For  the  nonsymmetric  loss  case,  the  acknowledgments  can  be 
augmented  with  the  one-way  loss  estimator  from  D,  which  can 
be  used  to  obtain  a  consistent  estimator  of  Gn- 
We  now  show  the  stability  of  (3.1)  using  Theorem  2  from  [1], 
namely  E[(Rs{ti)  -  ^  0  as  i  ^  00.  To  show  this  (3.1) 

can  be  rewritten  as 


Wi+i  =  Wi 


■  aj 


ys{ti)  -  lira  + 


(3.2) 


^;[Gyy(i)|i?s(i)  =  r]  =  r(r). 

Let  the  stabilization  rate  be  given  by  r(r.y)  =  7.  FI  ere  7  Is 
chosen  such  that  Is  within  the  Initial  Increasing  part  of  r(.). 
We  assume  that  r(r)  Is  locally  monotonic  In  the  neighborhood 
of  7  such  that:  r(r)  >  7  for  r  >  and  r(r)  <  7  for  r  <  r-y. 
This  assumption  Is  consistent  with  the  measurements  In  Fig.  1 
and,  also  with  the  concavity  assumptions  In  [2],  [4].  Note  that 
Gd(.)  Is  typically  nonsmooth  and  r(.)  Is  not  known. 


where  wedenoteXsW  =  Gs(i)(ra+T.^,)  and  =  k/E.lhen 
we  have 


E 


Ys{t,)\Wi,W2,...,W,  ={Ta+T^)T 


W- 


Ta+Tu 


We  assume  that 


var 


Ys{ti)\Wi,W2,...,Wi 


< 


(3.3) 


III.  Throughput  Stabilization 

In  our  method  Rs(t)  Is  adjusted  at  each  tlmei^  an  acknowl¬ 
edgment  Is  received,  by  using  an  estimate  Gs{t)  of  Goit).  At 
timeii,  \eti\he  window-size  Wi  denote  the  number  of  packets  to 
be  sent  followed  by  a  waiting-time  such  that  for  u  <  t  < 


Rs{t) 


Wi 

'fa  +  'f-w 


where  Is  the  time  needed  for  transmitting  the  packets  at  S. 
Att  =  ti+i,  the  window  size  Is  updated  as  follows: 


W,+i  =  Wi-pi  Gs(ti)-j 


(3.1) 


where  pi  =  (k(Ta  +  t^)IE)  for  0.5  <  a  <  1  and  fc  >  0  a 
suitably  chosen  constant.  This  method  Is  a  specific  form  of  the 


for  some  (T,  which  can  be  easily  ensured  by  thresholding  the  Wi 
values.  We  assume  there  exist  fco  and  ki  such  that 

ko\r  —  r-j\  <  |r(r)  —  7I  <  fcijr  —  r-^\  (3.4) 

which  Is  justified  by  the  measurements  at  low  Rs{.)  values. 
Let  =  r-f{Ta  +  T-ui)  correspond  to  the  Ideal  window  size 
that  achieves  the  stabilization  rate7lnan  average  sense.  Fora 
constant  value  of  LCy,  we  Invoke  Theorem  2  of  [1]  for  the  case 
It  Is  slowly  varying:  note  that  ujy  Is  an  average  quantity,  and 
the  measured  throughput  levels  for  this  window  could  be  quite 
varied.  Then  from  [1]  we  have  the  following  stability  result: 

Theorem  1:  Under  the  locally  monotonIc  response  re¬ 
gression,  symmetric  loss  and  conditions  In  (3.3)  and  (3.4), 
the  window-sizes  computed  In  (3.2)  satisfy  the  stability 
conditions  E[(Wi  -  or  equivalently 

E[(Rs(ti)  -  r-^f]  =  0(r“). 
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Fig.  2.  Stabilization  at  1.5  M  b/s. 


Since  Gs{t)  is  a  random  quantity,  it  is  very  criticai  that  the 
step  size  pi  in  (3.1)  be  chosen  to  satisfy  the  ciassicai  Robbins- 
M  onro  conditions  [7]:  (i)  ^  0  as7  ^  oo,  (ii)  =  oo, 

and  (iii)  particuiar,  a  fixed-step  size  (e,g, 

used  in  [4]  for  smooth  deterministic  case)  does  not  guarantee 
stabiiity,  and  in  fact  did  not  stabiiize  in  our  experiments  over 
the  I  nternet. 

The  above  stabiiity  anaiysis  remains  vaiid  for  the  case 
window-size  is  fixed  and  waiting-time  is  adapted  in  a  manner 
simiiar  to  (3.1).  Our  experimentai  resuits  are  quaiitativeiy 
identicai  in  both  these  cases. 

For  a  fixed  7,  changes  in  r(.)  wiii  resuit  in  different  (or 
Lo^)  vaiues.  Theaigorithm  in  (3.1)  is  very  robust  in  presence  of 
changes  in  r-y.  Changes  in  network  traffic  (for  exampie  due  to 
new  connections and/ortermination  of  oid  ones),  typicaiiy  have 
very  smaii  effect  on  the  regression  function  r(.).  In  particuiar, 
the  effect  of  individuai  traffic  streams  is  mitigated,  resuiting  in 
very  smaii  changes  in  r-y  in  the  iow  ioss  region.  Indeed,  the 
results  of  [1]  are  valid  when  r-y  is  time  varying.  If  rj{ti+i)  - 
r-y{ti)  =  theaigorithm  in  (3.1)  stabilizes  as: 


E[{Rs{ti)-r^f 


ifw>§Q! 
O  (7-[2(“-“)])  ifw<§Q!. 


IV.  EXPERIM  ENTAL  RESULTS 

Our  method  is  implemented  using  UDP,  and  tested  exten¬ 
sively  between  ORNL  and  LSU.  ORNL  is  connected  to  ESnet 
which  peers  with  Abilene  network  in  New  York  (before  recent 
upgrade).  Abilene  runs  from  New  York  via  Washington,  DC, 
andAtlanta,  GA,to  Flouston,TX,  where  it  connects  to  LSU  via 
a  regional  network.  In  terms  of  network  distance,  these  two  sites 
are  separated  by  more  than  two  thousand  miles,  and  both  ESnet 
and  Abilene  have  significant  traffic.  Fig.  2  showstypical  results 
for  target  throughput  at  1.5  and  3.0  M  b/s  all  below  the  peak 
bandwidth  but  above  throughput  1.09  M  b/s  achieved  by  default 
TCP.  In  each  plot,  top  and  bottom  curves  correspond  to  Rs(t) 


Fig.  3.  stabilization  at  2.5  M  b/s  throughput  under  background  large  file  ftp  at 
host  with  web  browsing. 


and  Goit),  respectively,  which  often  overlap  indicating  low  loss 
conditions.  The  stabilization  typically  occurred  under  very  low 
albeit  nonzero  packet  loss.  The  throughput  was  remarkably  ro¬ 
bust  and  was  virtually  unchanged  when  ftp  transfers  of  various 
filesizes  were  made  atthelocal  LAN  hosttogether  with  various 
web  browsing  operations  as  shown  in  Fig.  3. 

V.  Conclusions 

This  letter  establishes  that  the  classical  statistical  control 
methods  provide  provably  robust  throughput  stabilization 
over  wide-area  networks,  and  our  implementations  support 
these  results  over  the  Internet.  Flowever,  in  practice  the  target 
throughput  is  meant  only  to  be  a  small  fraction  of  available 
bandwidth  so  as  to  minimally  affect  the  other  traffic  streams.  In 
a  certain  sense,  this  approach  is  a  new  way  of  designing  stable 
transport  protocols  wherein  the  inherent  randomness  in  the 
measurements  is  handled  by  the  SA  algorithm.  Topics  of  future 
interest  include  throughput  maximization  using  SA  algorithms 
(such  as  Kiefer-Wolfowitz  or  simultaneous  perturbation  [3]) 
and  their  ability  to  be  fair  to  other  traffic  streams  in  the  network. 
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